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DEVICE AND METHOD FOR VOICE ACTIVITY DETECTION 

Field of the invention 

The present invention relates to a device, a mobile apparatus incorporating 
the device, an accessory therefor and a method for voice activity detection, 
particularly in a mobile telephone, using the directional sensitivity of a microphone 
system and exploiting the knowledge about the voice source's orientation in space. 
The device assists the existing voice activity detection to achieve higher sensitivity 
and requiring less processor power. 

State of the art 

Voice activity detectors are used e.g. in mobile phones to enhance the 
perfomiance in certain situations. The most common way to construct a voice 
activity detector is to look at the levels of the sub-bands of the incoming signal. 
IS Then the background noise level and the speech level are estimated and compared 
with a threshold to determine whether speech is present or not An example of a 
voice activity detector is disclosed in U.S. patent 6,427,134. 

For instance in noisy environments it is hard to make a uniform parameter 
set-iq> for the voice activity detector. Therefore several voice activity detectors are 
20 needed, trimmed to the specific cases. For example in some modules you need to be 
sure that if there is speech it should be detected (echo canceller), but in other cases 
it is better to indicate no speech if the signal to noise ratio level is too low. The 
plurality of voice activity detectors put a load on the digital signal processors that 
have to take care of performing the various voice activity detection algorithms. 

25 

Summarv of the invention 

An object of the present invention is to complement existing voice activity 
detection taking into account the direction of tbe source of the sound. 

In a first aspect, the invention provides a device for voice activity detection 
30 comprisuig a sound signal analyser arranged to determine whether a sound signal 
comprises speech. 

According to the invention, the device fiirther comprises 
a microphone system arranged to discriminate sounds emanating from sources 
located in different directions from the microphone system, so that sounds only 
35 emanating from a range of directions are included as signals possibly containing 
speech. 

Suitably, the range of directions is directed in the direction of an intended 
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elements separated a distance and located on a line directed in the direction of an 
intraded user's mouth. 

The range of directions may be defined as all sounds falling inside a cone 
with a cone angle co, wherein 10''<a<30°, and preferably, a is approximately 25°. 
5 In another embodiment, the microphone system comprises three microphone 

elements separated a distance and located in a plane directed in the direction of an 
intraded user's mouth. 

Suitably, two of said three microphone elements are separated a distance and 
located on a line directed p^endicular to the direction of an intended user's mouth. 
10 In another embodiment, the microphone system comprises four microphone 

clients located such that the fourth microphone is not located in the same plane as 
the three others. 

The microphone elements may be directional with a pattern having maximal 
sensitivity in the direction of an intended user's mouth. 

IS In still a further embodim^t, the microphone system comprises one 

directional microphone element together with one or more other microphone 
elements to remove the uncertainty in the direction of the sound source. The 
directional microphone element may be used to measure the sound pressure level 
relative to the other microphone element 

20 In a second aspect, the invention provides a mobile apparatus comprising a 

device as mentioned above. 

Suitably, the microphone elements are located at the lower edge of the 

apparatus. 

In one embodiment, a pluraUty of microphone elements are located at the 
25 lower edge of the apparatus and at least one further microphone element is located 
at a distance from the lower edge. 

The mobile apparatus may be a mobile radio terminal, e.g. a mobile 
telephone, a pager, a communicator, an electric organiser or a smartphone. 

Jn a third aspect, the invention provides an accessory for a mobile apparatus 
30 comprising a microphone system as mentioned above. 

Suitably, the direction of the range of directions is adjustable. 

The accessory may be a hands-free kit or a telephone conference 
microphone. 

In a fourth aspect, the invention provides a method for voice activity 
35 detection^ including the steps of: 

receiving sound signals from a microphone system arranged to discriminate sounds 
emanating from sources located in different directions from the microphone system; 
determining the direction of the sound source causing the sound signals; 
if the sounds emanate from a first range of directions, furth^ analyse 
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the sound to detennine whether the sound signal comprises speech; 

but if the sounds emanate from a second, different range of directions decide that 

the sound signal does not comprise speech. 

Suitably, the first range of directions is directed in the direction of an 
5 intended user's mouth. 

The first range of directions may be defined as all sounds falling inside cone 
with a cone angle ol, wherein 10''<a<30°, and preferably a is approximately 25 

In one embodiment, the microphone system comprises at least two 
microphone elements located at a distance fi^om each other and located on a line 
10 directed in the direction of an intended user's mouth, said two microphone elements 
being separated a distance d, wherem the direction to the sound source 6 is 
calculated as 

0 =arccos 

2d 

whwe 

15 At is the time difference between the sounds fix)m the two microphone elements, 
V is the velocity of sound. 

In another embodiment, one directional microphone element is used together 
with one or more other microphone elements to remove the uncertainty in the 
direction of the sound source. 
20 The directional microphone element may be used to measure Ifae sound 

pressure level relative to tiie other microphone element 

The invention is defined in the attached independent claims 1, 12, 16, and 20, 
while preferred embodiments are set forth in the dependent claims. 

25 Brief description of the drawings 

The invention will be described below in greater detail with reference to the 
accompanying drawings, in which: 

fig. 1 is a perspective view of a mobile phone incorporating the present 
invention, and 

30 fig. 2 is a schematic drawing of the receiving angle of an embodiment of the 

present invention. 

Detailed description of preferred embodiments 

As mentioned briefly in the introduction, many signal processing algorithms, 
35 such as echo cancellation and background noise synthesis, used in phones and 
hands-firee kits are based on the fact that the user is speaking or not. For example 

the sneech codec 155 active when the near-Rnd iisRr is ssnpialnna anri fh^. hanXrormmA 
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voice activity detectors (VAD) to perform well. An error in the detection can result 
in artefacts or malfunctions caused by divergence of the algorithms or other 
problems. 

Existing voice activity detectors are directed to deteraiine whether speech is 
5 present or not in a sound signal. However, in fact not all speech is interesting or 
relevant, but only the user's speech. All other speech, e.g. in a noisy environment 
with several persons speaking, could be ignored and regarded as just noise. 

The present inventor has realised that a microphone system having some 
kind of directional sensitivity could be used to discriminate sound emanating from 
10 different sources located in different directions. Sound not emanating firom the us^ 
can be declared as non-speech, and those signals do not have to be analysed with 
the conventional voice activity detectors. 

The existing voice activity detectors may be conventional and are only 
referred to as a sound signal analyser in this application. 
IS Generally, a microphone system having some kind of directional sensitivity 

can be used Fig. 1 shows an example with at least two separate microphone 
elements. 

A general mobile telephone is indicated at 1. The invention is equally 
applicable to other devices such as mobile radio terminals, pagers, communicators, 

20 electric organisers or smartphones. The common feature is that voice activity 
detection is employed, e.g. in connection with communicating speech or receiving 
voice commands by means of speech recognition. 

In the siniplest version, the miax>phone syst^ compises two microphones 
2a and 2b. Suitably, they are located on a line directed in the calculated direction of 

25 an intended user's mouth. Suitably, the microphone elements are located at the 
lower edge of the mobile apparatus 1 . 

Fig. 2 shows a schematic diagram of the calculation of the direction of the 
sound source, typically the user's mouth 3. In the case of two microphones, only the 
angle to the line on which the microphone elements are located can be determined. 

30 In other words, the direction of the sound source is on a cone vidth a cone angle 6. 
To calculate the angle 6, first a cross-correlation between the two signals from the 
microphones 2a and 2b is made. The maximum indicates the time difference At 
between the two microphones 2a and 2b. The distance between the two 
microphones 2a and 2b is e.g. 20 millimetres. The angle 6 is calculated as 
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Note that arccos is only defined for arguments between -1 and 1 . If the time 
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difference is negative, this means that the angle is greater than 90° and the sound 
emanates from behind the apparatus. 

Suitably, the device is adapted to determine ^t all sounds with an angle 6 
less than a iixed angle tt are emanating from the user. The threshold angle a may be 
5 set within a range of e.g. 10° to 30°, suitably at 25°. 

In the case of three microphones, the direction of the sound source can be 
finther determined to be at two points (e.g. on the above cone). The three 
microphone elements are suitably located in a plane directed in the general direction 
of the user's mouth. In fig. 1 microphone elements 2b, 2c and 2d are a possible set- 
10 up- The two microphone elraients 2c and 2d at the front are located on a line 
pCTpendicular to the direction of the user's mouth, while the third microphone 
element 2b is located at the rear side. 

In the case of four microphones (or more) detection of all direction angles 
may be calculated, provided that four microphone elements are located such that the 
IS fourth microphone is not located in the same plane as the lluree others, e.g. on a 
tetrahedroiL A possible set-up is two microphone elements 2c and 2d at the front on 
the lower edge, while a third microphone element 2b is located at the rear side, and 
a fourth microphone element 2e is located at the front at a distance from the lower 
edge. 

20 A similar microphone arrangement may be used in an accessory to a mobile 

apparatus, such as a hands-free kit or a telephone conference microphone system 
intended to be placed on a table. Apart from the microphone elements the logic 
circuitry may be located in the main/mobile apparatus. In this case the recqption 
angle of the microphone system can be adjustable. This is useftil e.g. when the 

25 microphone system is placed in a car, where the user can be seated either in the 
driver's seat or in the passenger's seat or even both the driver and the pass^ger 
may be speakers during the same call. The adjustment of the reception angle can be 
achieved mechanically or electronically, for example by beam forming or 
adaptation of the directional sensitivity of the microphone systeixL 

30 To fruther enhance the sensitivity of the microphone system, directional 

microphone elements with a pattern having a maximum sensitivity in the direction 
of the user's mouth could be used 

In a frirther embodiment, one directional microphone element is used 
together with one or two other microphone elements (that may be non-directional). 

35 The directional microphone element is used to measure the sound pressure level 
relative to the other(s), thus removing the uncertainty in the direction of the sound 
source. Various combinations of directional microphone elements and non- 
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performance. With the present invention only one voice activity detector may be 
necessary throughout Ihe whole signal path. This will in turn reduce the 
computational conq)l6xity, decreasing the load on the digital signal processors as 
well as improving the performance. It is especially favourable in enviromnents with 
5 high background noise and noise with similar spectral properties as speech. 

A person skilled in the art will realise that the invention may be realised with 
various combinations of hardware and software. The scope of the invention is only 
limited by the claims below. 



